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(54) METHOD FOR COMPRESSING AND EXPANDING VOICE TIME-BASE 
(57) Abstract: 

PURPOSE: To develop a voice compression and expansion system by which a voice waveform 
small in displacement and distortion can be obtained even when the compression/expansion ratio 
is close to 1. 

CONSTITUTION: When a voice signal is compressed, voice signal waveforms F and G whose 
lengths are >2 times as large a pitch cycles and integral multiples are cut from both the position 
PI of the current pointer of the inputted voice signal waveform and the position P2 which is one 
pitch cycle precedent to the pointer, and a couple of cut voice signal waveforms are weighted with 
proper Wl and W2 and added to generate one signal waveform E by compressing the two voice 
signal waveforms, and, the current pointer PI is moved on the inputted voice signal waveform 
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corresponding to compressibility to obtain a pointer P4, and similar operation is performed to 
obtain a compressed voice signal. 
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CLAIMS 



[Claim(s)] 

[Claim l] Are said twice [ more than ] pitch period, respectively with both the location of the 
present pointer of a sound signal wave inputted at the time of compression of a sound signal, and 
the location to which only 1 pitch period went to this pointer, and the sound signal wave of the 
die length of an integral multiple is started. Give suitable weight, add the sound signal wave of 
the started pair, and one signal wave form which compressed said two sound signal waves is 
generated. Next, the voice time-base-compaction approach characterized by acquiring the sound 
signal compressed by making said current pointer move on said inputted sound signal wave 
according to compressibility, and performing same actuation. 

[Claim 2] Are said twice [ more than ] pitch period, respectively with both the location of the 
present pointer of a sound signal wave inputted at the time of expanding of a sound signal, and 
the location in which only 1 pitch period was to this pointer, and the sound signal wave of the die 
length of an integral multiple is started. Give suitable weight, add the sound signal wave of the 
started pair, and one signal wave form which compounded said two sound signal waves is 
generated. Next, the voice time-axis expanding approach characterized by acquiring the sound 
signal elongated by making said current pointer move on said inputted sound signal wave 
according to the rate of expanding, and performing same actuation. 



DETAILED DESCRIPTION 



[Detailed Description of the Invention] 
[0001] 

[Industrial Application] This invention relates to the audio time base compaction or the audio 
expanding approach which mainly encodes the digital sound signal used for the utterance 
speed-conversion approach, a voice IC recorder, etc. of the voice which controls the utterance rate 
in the case of processing voice in broadcast, a movie, and music. 
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[0002] 

[Description of the Prior Art] A conventional digital sound signal is made into time base 
compaction or the approach of elongating. IEEE Trans.Acoust.Speech, Signal Processing, 
vol.ASSP-27, pp. 121-133, and Apr. ? 79"Time-domain algorithms for harmonic bandwidth 
reduction and time scaling of speech signals" TDHS (Tme-Domain Harmonic Scaling) 
indicated by D.Malah work and the collection of the flexible method in the time-axis of the voice 
by the correlation method, and "evaluation" Institute of Electronics and Communication 
Engineers spring lecture drafts - pp.9- 16 and 1986.5.21 Naotaka Morita " PICOLA (Pointer 
Interval Control OverLap and Add) indicated by Itakura ****** is advocated. 
[0003] It is die length Nc and [0004] from P2, respectively the time of only the pitch period T 
progressing to the present pointer location PI and this, as shown in the wave form chart of 
drawing 7 (a) when setting to T the pitch period beforehand extracted using the pitch extractor 
to the voice wave in said TDHS and setting compressibility (or rate of expanding) to r. 
[Equation l] 
N c = iT / (1 - r) 

[0005] It is die length Nc by starting ****** wave A and B, giving the weight (windowing 
function shown by the drawing destructive line F2) which goes conversely to 1 the weight 
(windowing function shown by the drawing destructive line Fl) which goes to 1-0 linearly 
linearly from 0 at one wave B to Wave A, and adding both. Wave C is acquired. Next, it is a 
pointer on said input wave ( drawing 7 (a)) [0006] 

[Equation 2] 

N c + T = T / (1 - r) 

[0007] It is the following die length Nc by carrying out **** migration, starting two waves of the 
pitch period T like the above, giving weight to these and adding to them. A wave is acquired. It is 
the input wave of die-length Nc+T to the die length Nc by repeating this actuation henceforth. 
An output wave is made continuously. And this die length Nc The wave is what satisfied 
compressibility r to the input wave of die-length Nc+T. 

[0008] It is die length Ns and [0009] from P4, respectively the time of on the other hand, only the 
pitch period T being overdue to the present pointer location Pi and this in expanding, as shown 
in the wave form chart of drawing 8 (a) and (b). 
[Equation 3] 
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N s - iT / (r - 1) 



[0010] It is die length Ns by starting ****** wave A and B, giving the weight (windowing 

function shown by the drawing destructive line F4) which goes conversely to 0 the weight 

(windowing function shown by the drawing destructive line F3) which goes to 0-1 linearly 

linearly from 1 at one wave B to Wave A, and adding both. Wave C is acquired. Next, it is a 

pointer on said input wave ( drawing 8 (a)) [00 11] 

[Equation 4] 

N g - T = T / (r - 1) 

[0012] It is the following die length Ns by carrying out **** migration, starting two waves of the 
pitch period T like the above, attaching weight to these and adding to them. A wave is acquired. 
The output wave of the rate r of expanding is continuously acquired to the input wave of 
die "length Ns-T by repeating this actuation henceforth. And this die length Ns The wave is what 
satisfied the rate r of expanding to the input wave of die-length Ns-T. 

[0013] By the way, if this TDHS is applied to compression or expanding of a time-axis, 
compressibility or the rate r of expanding is the wave length Nc as mentioned above, when close 
to 1. Ns It became larger than the pitch period T too much, the wave-like gap by fluctuation of a 
pitch period arose within the voice wave to add, and there was a trouble of becoming the cause of 
distortion. 
[0014] 

[Problem(s) to be Solved by the Invention] When TDHS is applied to compression or expanding of 
a time-axis as mentioned above, compressibility (or rate of expanding) r this invention When 
close to 1, Nc </SUB> Ns The wave-like gap by fluctuation of a pitch period arises within the 
voice wave added more greatly than T. In view of the conventional trouble which is said when it 
comes to the cause of distortion, even when compressibility (or rate of expanding) is close to 1, it 
aims at developing the speech compression and the expanding method with which the small 
voice wave of a gap or distortion is acquired. 
[0015] 

[Means for Solving the Problem] This invention starts the sound signal wave of the die length of 
the more than twice and the integral multiple of said pitch period, respectively from both the 
location of the present pointer of a sound signal wave inputted at the time of compression of a 
sound signal, and the location to which only 1 pitch period went to this pointer. Give suitable 
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weight, add the sound signal wave of the started pair, and one signal wave form which 
compressed said two sound signal waves is generated. Next, said current pointer is made to move 
on said inputted sound signal wave or the generated sound signal according to compressibility, 
and the sound signal compressed by performing same actuation is acquired. 
[0016] Moreover, this invention is said twice [ more than ] pitch period, respectively with both 
the location of the present pointer of a sound signal wave inputted at the time of expanding of a 
sound signal, and the location in which only 1 pitch period was to this pointer, and starts the 
sound signal wave of the die length of an integral multiple. Give suitable weight, add the sound 
signal wave of the started pair, and one signal wave form which compounded said two sound 
signal waves is generated. Next, said current pointer is made to move on said inputted sound 
signal wave or the generated sound signal according to the rate of expanding, and the sound 
signal elongated by performing same actuation is acquired. 
[0017] 

[Function] The die length of a voice wave which is applied according to the above-mentioned 
configuration is concerned with compressibility or the rate of expanding, and the distortion of the 
sound signal acquired as the waves which were twice [ more than ] the pitch period that there is 
nothing, and were the die length of an integral multiple, and were always most alike since 
adjustment of compressibility or the rate of expanding was performed by the movement 
magnitude of a pointer were added and compressibility or the rate of expanding approached 1 
becomes small. 
[0018] 

[Example] The voice time-base-compaction approach and the expanding approach of this 

invention are explained to a detail based on a drawing about the one example below. 

The Fig. corresponding to the [compression approach] shows the circuit block diagram with 

which drawing 1 - drawing 3 drawing 1 realizes the compression approach concerned. 

[0019] Analog sound signal s (t) is changed into digitized voice data s (n) by AID converter 1 in 

drawing 1 . Next, the pitch period T of digitized voice data s (n) is extracted by the pitch extractor 

2, time base compaction of voice data s (n) is performed by the time-base-compaction machine 3 

using the extracted pitch information, and the digitized voice signal sc (n) compressed as a result 

is acquired. 

[0020] And said pitch period information T and the compressed sound signal sc (n) are changed 

into D/A converter 4 delivery and here at analog sound signal s' (t), and is outputted. 

[0021] Detailed work of the time-axis compressor 3 in said drawing 1 is explained using drawing 
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2 (a), (b), and drawing 3 (a) and (b) below. 

[0022] As said time-axis compressor 3 shows to drawing 2 (a), the pointer PI of the present 
respectively, on the other hand, the time of only the pitch period T progressing from P2 - 
respectively - the voice wave F of die-length 2T (the voice wave A+ voice wave B ■-) namely, the 
section of pointers P1-P3 and the voice wave G (the voice wave B+ voice wave C ■-) That is, the 
section of pointers P2-P4 is started, the weight (windowing function Wl) which goes to TO is 
given to Wave F, the weight (windowing function W2) which goes to 1 from 0 conversely is given 
to Wave G, both [ these ] waves are added, and the voice wave E of die-length 2T is acquired 
(refer to drawing 2 (b)). In addition, the weight Wl and W2 here is formed in order to maintain 
the wave-like continuity in the node before and behind the voice wave E. 
[0023] Next, Lc of drawing 2 [0024] 
[Equation 5] 
L c - iT / (1 - r) 

[0025] the time of carrying out - the time of Lc >=2T - the input voice top of said drawing 2 (a) - 
T+Lc only -- a pointer is moved and the actuation same as a pointer P5 as the above is performed. 
The output at this time will output said wave E and the input waves of die-length Lc-2T from P4 
to P5 following this as they are. 

[0026] the input voice top which shows drawing 3 (a) on the other hand at the time of Lc <2T a 
pointer PI to T+Lc only ■■ a pointer is moved to the right and the actuation same as a pointer P6 
as drawing 2 (a) is performed. The output at this time is the die length Lc in the first half of 
Wave E. It becomes only a part and they are remaining 2 T-Lc. Since a part is again used for the 
next processing, it is returned to an input side. 

[0027] It is die-length T+Lc by the above actuation. An input wave to die length Lc The output 
wave was made and this has satisfied the compressibility r of a sound signal. In addition, 
compressibility r=3/4, drawing 3 (a), and (b) show the case of compressibility r=3/5, respectively, 
and, as for drawing 2 (a) and (b), the wave length which starts a compressional-wave form shows 
the case of the twice of a pitch period for both sides. 

[0028] It cannot be overemphasized that it is not what is restricted to this example that the wave 

started, of course should just be a twice [ more than ] as many integral multiple as this. 

The Fig. corresponding to the [expanding approach] shows the circuit block diagram with which 

drawing 4 • drawing 6 drawing 4 realizes the expanding approach concerned. 

[0029] Analog sound signal s (t) is changed into digitized voice data s (n) by A/D converter 1 in 
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drawing 4 . Next, the pitch period T of digitized voice data s (n) is extracted by the pitch extractor 
2, time-axis expanding of voice data s (n) is performed by the time-axis expandor 5 using the 
extracted pitch information, and the digitized voice signal se (n) elongated as a result is acquired. 
[0030] And said pitch period information T and the elongated sound signal se (n) are changed 
into D/A converter 4 delivery and here at analog sound signal s' (t), and is outputted. 
[0031] Detailed work of the time-axis expandor 5 in said drawing 4 is explained using drawing 5 
(a), (b), and drawing 6 (a) and (b) below. 

[0032] As said time-axis expandor 5 shows to drawing 5 (a), the present pointer PI, the time of 
only the pitch period T being overdue to this pointer PI - from P4 - respectively - the voice 
wave F of die-length 2T (the wave A+ wave B -) namely, the section of pointers P4-P2 and the 
voice wave G (the wave B+ wave C ■•) That is, the section of pointers P1-P3 is started, and the 
voice wave H of die-length 2T which gives the weight W4 which tends toward Wave G weight W3 
which goes to 0-1 1*0 to Wave F, and is shown in drawing 5 (b) in addition is created. Weight W3 
at this time and W4 are prepared in order to keep the continuity in the node before and behind 
the voice wave H being the same as that of the case of said time-axis compressor 3. 
[0033] And Ls [0034] 
[Equation 6] 
Lg = T / (r - 1) 

[0035] the time of carrying out - the time of Ls >=T - the input voice top of said drawing 5 (a) - 
Ls only - it is made to move and actuation same as a pointer P3 is performed. The output at this 
time outputs Wave H and the input wave (wave C) of die-length Ls-T to the pointers P2-P3 
following this as it is. on the other hand ■- the time of Ls <T - the input voice top of drawing 6 (a) 
- a pointer PI to Ls only - a pointer is moved and the actuation same as a pointer P5 as the 
above is performed. The output at this time is die-length T+Ls in the first half of Wave H. It 
becomes only a part and is remaining T-Ls. Since a part is again used for the next processing, it 
is returned to an input side. 

[0036] It is die length Ls by the above actuation. An input wave to die-length T+Ls The output 
wave is made and it turns out that the rate r of expanding is satisfied. In addition, drawing 5 (a) 
and (b) show the case of rate r=of expanding 3/2, and the wave length which starts when both 
drawing 6 (a) and (b) make an expanding wave from the case of the rate r= 3 of expanding shows 
the case of the twice of a pitch period. It cannot be overemphasized that it is not what is 
restricted to this example that the wave started, of course should just be a twice [ more than ] as 
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many integral multiple as this. 
[0037] 

[Effect of the Invention] The die length of a voice wave applied at the time of compression or 
expanding is concerned with compressibility or the rate of expanding, it is the die length of a 
twice [ more than ] as many integral multiple as a pitch period that there is nothing, and since 
compression or adjustment of expanding is performed by the movement magnitude of a pointer, 
the wave-like gap by fluctuation of a pitch period is suppressed, and, according to the 
time-base-compaction approach of this invention, and the expanding approach, it can expect the 
effectiveness that the distortion of a voice wave at the time of compression or expanding becomes 
small as a result. 

[0038] Moreover, the effectiveness that distortion becomes small is also expectable as 
compressibility or the rate of expanding approaches 1 as compared with the conventional 
approach. 



DESCRIPTION OF DRAWINGS 



[Brief Description of the Drawings] 

[Drawing l] It is the block circuit diagram which realizes the time-base -compaction approach of 
this invention. 

[Drawing 2] It is drawing showing input wave (a) at the time of compression by this invention 
approach (LC >=2T), and output wave (b). 

[Drawing 3] It is drawing showing input wave (a) at the time of compression by this invention 
approach (Lc <2T), and output wave (b). 

[Drawing 4] It is the block circuit diagram which realizes the time-axis expanding approach of 
this invention. 
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[Drawing 5] It is drawing showing input wave (a) at the time of expanding by this invention 
approach (Ls >=T), and output wave (b). 

[Drawing 61 It is drawing showing input wave (a) at the time of expanding by this invention 
approach (Ls <T), and output wave (b). 

[Drawing 71 It is drawing of the conventional approach corresponding to drawing 2 and drawing 
3 . 

[Drawing 81 It is drawing of the conventional approach corresponding to drawing 5 and drawing 
6 . 

[Description of Notations] 

1 A/D Converter 

2 Pitch Extractor 

3 Time-axis Compressor 

4 D/A Converter 

5 Time -axis Expandor 
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